Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 8760 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 MiB |
| Average record size in memory | 131.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 6 |
| Boolean | 3 |
date has a high cardinality: 365 distinct values | High cardinality |
wdsp is highly correlated with wind_bft | High correlation |
month is highly correlated with year | High correlation |
year is highly correlated with month | High correlation |
dayofweek_n is highly correlated with working_day | High correlation |
working_day is highly correlated with dayofweek_n | High correlation |
wind_bft is highly correlated with wdsp | High correlation |
wdsp is highly correlated with wind_bft | High correlation |
month is highly correlated with year | High correlation |
year is highly correlated with month | High correlation |
dayofweek_n is highly correlated with working_day | High correlation |
working_day is highly correlated with dayofweek_n | High correlation |
wind_bft is highly correlated with wdsp | High correlation |
rain is highly correlated with year and 3 other fields | High correlation |
temp is highly correlated with year and 3 other fields | High correlation |
rhum is highly correlated with year and 3 other fields | High correlation |
wdsp is highly correlated with year and 3 other fields | High correlation |
hour is highly correlated with year and 3 other fields | High correlation |
day is highly correlated with year and 3 other fields | High correlation |
month is highly correlated with year and 3 other fields | High correlation |
year is highly correlated with rain and 9 other fields | High correlation |
holiday is highly correlated with rain and 9 other fields | High correlation |
dayofweek_n is highly correlated with working_day and 1 other fields | High correlation |
working_day is highly correlated with rain and 9 other fields | High correlation |
peak is highly correlated with rain and 9 other fields | High correlation |
dayofweek is highly correlated with working_day | High correlation |
year is highly correlated with season | High correlation |
working_day is highly correlated with dayofweek | High correlation |
season is highly correlated with year | High correlation |
rain is highly correlated with rainfall_intensity | High correlation |
temp is highly correlated with month and 1 other fields | High correlation |
rhum is highly correlated with hour | High correlation |
wdsp is highly correlated with wind_bft | High correlation |
hour is highly correlated with rhum and 3 other fields | High correlation |
month is highly correlated with temp and 2 other fields | High correlation |
year is highly correlated with month and 1 other fields | High correlation |
dayofweek_n is highly correlated with dayofweek and 1 other fields | High correlation |
dayofweek is highly correlated with dayofweek_n and 1 other fields | High correlation |
working_day is highly correlated with dayofweek_n and 2 other fields | High correlation |
season is highly correlated with temp and 2 other fields | High correlation |
peak is highly correlated with hour and 2 other fields | High correlation |
timesofday is highly correlated with hour and 2 other fields | High correlation |
rainfall_intensity is highly correlated with rain | High correlation |
wind_bft is highly correlated with wdsp | High correlation |
count is highly correlated with hour and 1 other fields | High correlation |
date is uniformly distributed | Uniform |
rain has 7862 (89.7%) zeros | Zeros |
hour has 365 (4.2%) zeros | Zeros |
dayofweek_n has 1272 (14.5%) zeros | Zeros |
count has 1794 (20.5%) zeros | Zeros |
Reproduction
| Analysis started | 2022-04-24 12:49:14.464220 |
|---|---|
| Analysis finished | 2022-04-24 12:50:00.315539 |
| Duration | 45.85 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 48 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06842465753 |
| Minimum | 0 |
|---|---|
| Maximum | 10.3 |
| Zeros | 7862 |
| Zeros (%) | 89.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.3 |
| Maximum | 10.3 |
| Range | 10.3 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3533969449 |
|---|---|
| Coefficient of variation (CV) | 5.164760155 |
| Kurtosis | 155.9776871 |
| Mean | 0.06842465753 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.03913757 |
| Sum | 599.4 |
| Variance | 0.1248894006 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=48)
| Value | Count | Frequency (%) |
| 0 | 7862 | |
| 0.1 | 272 | 3.1% |
| 0.2 | 112 | 1.3% |
| 0.3 | 81 | 0.9% |
| 0.4 | 61 | 0.7% |
| 0.6 | 50 | 0.6% |
| 0.5 | 39 | 0.4% |
| 0.7 | 36 | 0.4% |
| 0.9 | 27 | 0.3% |
| 0.8 | 26 | 0.3% |
| Other values (38) | 194 | 2.2% |
| Value | Count | Frequency (%) |
| 0 | 7862 | |
| 0.1 | 272 | 3.1% |
| 0.2 | 112 | 1.3% |
| 0.3 | 81 | 0.9% |
| 0.4 | 61 | 0.7% |
| 0.5 | 39 | 0.4% |
| 0.6 | 50 | 0.6% |
| 0.7 | 36 | 0.4% |
| 0.8 | 26 | 0.3% |
| 0.9 | 27 | 0.3% |
| Value | Count | Frequency (%) |
| 10.3 | 1 | |
| 6.1 | 1 | |
| 5.5 | 2 | |
| 5.2 | 1 | |
| 5.1 | 2 | |
| 4.9 | 1 | |
| 4.7 | 1 | |
| 4.6 | 1 | |
| 4.5 | 1 | |
| 4.4 | 1 |
| Distinct | 293 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.18797945 |
| Minimum | -4.5 |
|---|---|
| Maximum | 26.3 |
| Zeros | 12 |
| Zeros (%) | 0.1% |
| Negative | 137 |
| Negative (%) | 1.6% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | -4.5 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6.6 |
| median | 10 |
| Q3 | 13.9 |
| 95-th percentile | 18.3 |
| Maximum | 26.3 |
| Range | 30.8 |
| Interquartile range (IQR) | 7.3 |
Descriptive statistics
| Standard deviation | 5.036550324 |
|---|---|
| Coefficient of variation (CV) | 0.4943620418 |
| Kurtosis | -0.3417585134 |
| Mean | 10.18797945 |
| Median Absolute Deviation (MAD) | 3.6 |
| Skewness | 0.0770660156 |
| Sum | 89246.7 |
| Variance | 25.36683916 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 10.6 | 83 | 0.9% |
| 9.2 | 83 | 0.9% |
| 8.7 | 81 | 0.9% |
| 8 | 81 | 0.9% |
| 8.9 | 80 | 0.9% |
| 10.1 | 80 | 0.9% |
| 13.2 | 79 | 0.9% |
| 10.7 | 79 | 0.9% |
| 7.8 | 78 | 0.9% |
| 9.3 | 77 | 0.9% |
| Other values (283) | 7959 |
| Value | Count | Frequency (%) |
| -4.5 | 1 | < 0.1% |
| -4 | 1 | < 0.1% |
| -3.9 | 1 | < 0.1% |
| -3.4 | 2 | |
| -3.3 | 1 | < 0.1% |
| -3.2 | 2 | |
| -3 | 1 | < 0.1% |
| -2.9 | 3 | |
| -2.8 | 1 | < 0.1% |
| -2.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 26.3 | 3 | |
| 26.2 | 1 | < 0.1% |
| 25.9 | 1 | < 0.1% |
| 25.7 | 2 | |
| 25.6 | 1 | < 0.1% |
| 25.4 | 3 | |
| 25.3 | 2 | |
| 25.2 | 1 | < 0.1% |
| 25.1 | 2 | |
| 25 | 1 | < 0.1% |
| Distinct | 69 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82.33047945 |
| Minimum | 24 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 24 |
|---|---|
| 5-th percentile | 59 |
| Q1 | 75 |
| median | 84 |
| Q3 | 91 |
| 95-th percentile | 98 |
| Maximum | 100 |
| Range | 76 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 11.67070283 |
|---|---|
| Coefficient of variation (CV) | 0.1417543406 |
| Kurtosis | 0.4806764447 |
| Mean | 82.33047945 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.8465907315 |
| Sum | 721215 |
| Variance | 136.2053045 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 89 | 348 | 4.0% |
| 88 | 333 | 3.8% |
| 90 | 328 | 3.7% |
| 91 | 327 | 3.7% |
| 87 | 319 | 3.6% |
| 94 | 304 | 3.5% |
| 95 | 302 | 3.4% |
| 82 | 300 | 3.4% |
| 92 | 296 | 3.4% |
| 84 | 295 | 3.4% |
| Other values (59) | 5608 |
| Value | Count | Frequency (%) |
| 24 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 39 | 2 | |
| 40 | 4 | |
| 41 | 3 |
| Value | Count | Frequency (%) |
| 100 | 171 | |
| 99 | 116 | 1.3% |
| 98 | 167 | |
| 97 | 203 | |
| 96 | 248 | |
| 95 | 302 | |
| 94 | 304 | |
| 93 | 295 | |
| 92 | 296 | |
| 91 | 327 |
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.635502283 |
| Minimum | 1 |
|---|---|
| Maximum | 35 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 5 |
| median | 8 |
| Q3 | 11 |
| 95-th percentile | 17 |
| Maximum | 35 |
| Range | 34 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.445983842 |
|---|---|
| Coefficient of variation (CV) | 0.5148494779 |
| Kurtosis | 1.55646894 |
| Mean | 8.635502283 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.007973409 |
| Sum | 75647 |
| Variance | 19.76677232 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=33)
| Value | Count | Frequency (%) |
| 6 | 911 | |
| 7 | 893 | |
| 5 | 801 | |
| 8 | 789 | |
| 9 | 694 | 7.9% |
| 4 | 677 | 7.7% |
| 10 | 650 | 7.4% |
| 11 | 556 | 6.3% |
| 3 | 486 | 5.5% |
| 12 | 429 | 4.9% |
| Other values (23) | 1874 |
| Value | Count | Frequency (%) |
| 1 | 57 | 0.7% |
| 2 | 257 | 2.9% |
| 3 | 486 | |
| 4 | 677 | |
| 5 | 801 | |
| 6 | 911 | |
| 7 | 893 | |
| 8 | 789 | |
| 9 | 694 | |
| 10 | 650 |
| Value | Count | Frequency (%) |
| 35 | 2 | < 0.1% |
| 33 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 6 | |
| 29 | 4 | < 0.1% |
| 28 | 7 | |
| 27 | 6 | |
| 26 | 4 | < 0.1% |
| 25 | 8 | |
| 24 | 10 |
| Distinct | 365 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.6 KiB |
| 2021-03-01 | 24 |
|---|---|
| 2021-11-07 | 24 |
| 2021-11-05 | 24 |
| 2021-11-04 | 24 |
| 2021-11-03 | 24 |
| Other values (360) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021-03-01 |
|---|---|
| 2nd row | 2021-03-01 |
| 3rd row | 2021-03-01 |
| 4th row | 2021-03-01 |
| 5th row | 2021-03-01 |
Common Values
| Value | Count | Frequency (%) |
| 2021-03-01 | 24 | 0.3% |
| 2021-11-07 | 24 | 0.3% |
| 2021-11-05 | 24 | 0.3% |
| 2021-11-04 | 24 | 0.3% |
| 2021-11-03 | 24 | 0.3% |
| 2021-11-02 | 24 | 0.3% |
| 2021-11-01 | 24 | 0.3% |
| 2021-10-31 | 24 | 0.3% |
| 2021-10-30 | 24 | 0.3% |
| 2021-10-29 | 24 | 0.3% |
| Other values (355) | 8520 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 2021-03-01 | 24 | 0.3% |
| 2021-03-24 | 24 | 0.3% |
| 2021-03-03 | 24 | 0.3% |
| 2021-03-04 | 24 | 0.3% |
| 2021-03-05 | 24 | 0.3% |
| 2021-03-06 | 24 | 0.3% |
| 2021-03-07 | 24 | 0.3% |
| 2021-03-08 | 24 | 0.3% |
| 2021-03-09 | 24 | 0.3% |
| 2021-03-10 | 24 | 0.3% |
| Other values (355) | 8520 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.5 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 365 |
| Zeros (%) | 4.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5.75 |
| median | 11.5 |
| Q3 | 17.25 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 11.5 |
Descriptive statistics
| Standard deviation | 6.922581688 |
|---|---|
| Coefficient of variation (CV) | 0.6019636251 |
| Kurtosis | -1.204176265 |
| Mean | 11.5 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0 |
| Sum | 100740 |
| Variance | 47.92213723 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=24)
| Value | Count | Frequency (%) |
| 0 | 365 | 4.2% |
| 1 | 365 | 4.2% |
| 22 | 365 | 4.2% |
| 21 | 365 | 4.2% |
| 20 | 365 | 4.2% |
| 19 | 365 | 4.2% |
| 18 | 365 | 4.2% |
| 17 | 365 | 4.2% |
| 16 | 365 | 4.2% |
| 15 | 365 | 4.2% |
| Other values (14) | 5110 |
| Value | Count | Frequency (%) |
| 0 | 365 | |
| 1 | 365 | |
| 2 | 365 | |
| 3 | 365 | |
| 4 | 365 | |
| 5 | 365 | |
| 6 | 365 | |
| 7 | 365 | |
| 8 | 365 | |
| 9 | 365 |
| Value | Count | Frequency (%) |
| 23 | 365 | |
| 22 | 365 | |
| 21 | 365 | |
| 20 | 365 | |
| 19 | 365 | |
| 18 | 365 | |
| 17 | 365 | |
| 16 | 365 | |
| 15 | 365 | |
| 14 | 365 |
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.72054795 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.796749115 |
|---|---|
| Coefficient of variation (CV) | 0.5595701337 |
| Kurtosis | -1.193150834 |
| Mean | 15.72054795 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.007522437488 |
| Sum | 137712 |
| Variance | 77.382795 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=31)
| Value | Count | Frequency (%) |
| 1 | 288 | 3.3% |
| 2 | 288 | 3.3% |
| 28 | 288 | 3.3% |
| 27 | 288 | 3.3% |
| 26 | 288 | 3.3% |
| 25 | 288 | 3.3% |
| 24 | 288 | 3.3% |
| 23 | 288 | 3.3% |
| 22 | 288 | 3.3% |
| 21 | 288 | 3.3% |
| Other values (21) | 5880 |
| Value | Count | Frequency (%) |
| 1 | 288 | |
| 2 | 288 | |
| 3 | 288 | |
| 4 | 288 | |
| 5 | 288 | |
| 6 | 288 | |
| 7 | 288 | |
| 8 | 288 | |
| 9 | 288 | |
| 10 | 288 |
| Value | Count | Frequency (%) |
| 31 | 168 | |
| 30 | 264 | |
| 29 | 264 | |
| 28 | 288 | |
| 27 | 288 | |
| 26 | 288 | |
| 25 | 288 | |
| 24 | 288 | |
| 23 | 288 | |
| 22 | 288 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.526027397 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.448048134 |
|---|---|
| Coefficient of variation (CV) | 0.5283533035 |
| Kurtosis | -1.207055959 |
| Mean | 6.526027397 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.01045819518 |
| Sum | 57168 |
| Variance | 11.88903593 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) |
| 3 | 744 | |
| 5 | 744 | |
| 7 | 744 | |
| 8 | 744 | |
| 10 | 744 | |
| 12 | 744 | |
| 1 | 744 | |
| 4 | 720 | |
| 6 | 720 | |
| 9 | 720 | |
| Other values (2) | 1392 |
| Value | Count | Frequency (%) |
| 1 | 744 | |
| 2 | 672 | |
| 3 | 744 | |
| 4 | 720 | |
| 5 | 744 | |
| 6 | 720 | |
| 7 | 744 | |
| 8 | 744 | |
| 9 | 720 | |
| 10 | 744 |
| Value | Count | Frequency (%) |
| 12 | 744 | |
| 11 | 720 | |
| 10 | 744 | |
| 9 | 720 | |
| 8 | 744 | |
| 7 | 744 | |
| 6 | 720 | |
| 5 | 744 | |
| 4 | 720 | |
| 3 | 744 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.6 KiB |
| 2021 | |
|---|---|
| 2022 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2021 |
|---|---|
| 2nd row | 2021 |
| 3rd row | 2021 |
| 4th row | 2021 |
| 5th row | 2021 |
Common Values
| Value | Count | Frequency (%) |
| 2021 | 7344 | |
| 2022 | 1416 | 16.2% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 2021 | 7344 | |
| 2022 | 1416 | 16.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.7 KiB |
| False | |
|---|---|
| True | 192 |
| Value | Count | Frequency (%) |
| False | 8568 | |
| True | 192 | 2.2% |
dayofweek_n
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.991780822 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1272 |
| Zeros (%) | 14.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.00351923 |
|---|---|
| Coefficient of variation (CV) | 0.6696744679 |
| Kurtosis | -1.252932851 |
| Mean | 2.991780822 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.003108904696 |
| Sum | 26208 |
| Variance | 4.014089305 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=7)
| Value | Count | Frequency (%) |
| 0 | 1272 | |
| 1 | 1248 | |
| 2 | 1248 | |
| 3 | 1248 | |
| 4 | 1248 | |
| 5 | 1248 | |
| 6 | 1248 |
| Value | Count | Frequency (%) |
| 0 | 1272 | |
| 1 | 1248 | |
| 2 | 1248 | |
| 3 | 1248 | |
| 4 | 1248 | |
| 5 | 1248 | |
| 6 | 1248 |
| Value | Count | Frequency (%) |
| 6 | 1248 | |
| 5 | 1248 | |
| 4 | 1248 | |
| 3 | 1248 | |
| 2 | 1248 | |
| 1 | 1248 | |
| 0 | 1272 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.6 KiB |
| Monday | |
|---|---|
| Tuesday | |
| Wednesday | |
| Thursday | |
| Friday | |
| Other values (2) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.139726027 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Monday |
|---|---|
| 2nd row | Monday |
| 3rd row | Monday |
| 4th row | Monday |
| 5th row | Monday |
Common Values
| Value | Count | Frequency (%) |
| Monday | 1272 | |
| Tuesday | 1248 | |
| Wednesday | 1248 | |
| Thursday | 1248 | |
| Friday | 1248 | |
| Saturday | 1248 | |
| Sunday | 1248 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| monday | 1272 | |
| tuesday | 1248 | |
| wednesday | 1248 | |
| thursday | 1248 | |
| friday | 1248 | |
| saturday | 1248 | |
| sunday | 1248 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.7 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 6120 | |
| False | 2640 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.6 KiB |
| Summer | |
|---|---|
| Spring | |
| Winter | |
| Autumn |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Winter |
|---|---|
| 2nd row | Winter |
| 3rd row | Winter |
| 4th row | Winter |
| 5th row | Winter |
Common Values
| Value | Count | Frequency (%) |
| Summer | 2256 | |
| Spring | 2208 | |
| Winter | 2160 | |
| Autumn | 2136 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| summer | 2256 | |
| spring | 2208 | |
| winter | 2160 | |
| autumn | 2136 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.7 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 6210 | |
| True | 2550 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.6 KiB |
| Night | |
|---|---|
| Afternoon | |
| Morning | |
| Evening |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.833333333 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Night |
|---|---|
| 2nd row | Night |
| 3rd row | Night |
| 4th row | Night |
| 5th row | Night |
Common Values
| Value | Count | Frequency (%) |
| Night | 2920 | |
| Afternoon | 2190 | |
| Morning | 1825 | |
| Evening | 1825 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| night | 2920 | |
| afternoon | 2190 | |
| morning | 1825 | |
| evening | 1825 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.6 KiB |
| no rain | |
|---|---|
| drizzle | 465 |
| moderate rain | 320 |
| light rain | 100 |
| heavy rain | 13 |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 7.257876712 |
| Min length | 7 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | no rain |
|---|---|
| 2nd row | no rain |
| 3rd row | no rain |
| 4th row | no rain |
| 5th row | no rain |
Common Values
| Value | Count | Frequency (%) |
| no rain | 7862 | |
| drizzle | 465 | 5.3% |
| moderate rain | 320 | 3.7% |
| light rain | 100 | 1.1% |
| heavy rain | 13 | 0.1% |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| rain | 8295 | |
| no | 7862 | |
| drizzle | 465 | 2.7% |
| moderate | 320 | 1.9% |
| light | 100 | 0.6% |
| heavy | 13 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.987214612 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.048028504 |
|---|---|
| Coefficient of variation (CV) | 0.3508380347 |
| Kurtosis | 0.3763836352 |
| Mean | 2.987214612 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.604645669 |
| Sum | 26168 |
| Variance | 1.098363744 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=8)
| Value | Count | Frequency (%) |
| 3 | 3026 | |
| 2 | 2875 | |
| 4 | 1872 | |
| 5 | 529 | 6.0% |
| 1 | 314 | 3.6% |
| 6 | 117 | 1.3% |
| 7 | 25 | 0.3% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 314 | 3.6% |
| 2 | 2875 | |
| 3 | 3026 | |
| 4 | 1872 | |
| 5 | 529 | 6.0% |
| 6 | 117 | 1.3% |
| 7 | 25 | 0.3% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 2 | < 0.1% |
| 7 | 25 | 0.3% |
| 6 | 117 | 1.3% |
| 5 | 529 | 6.0% |
| 4 | 1872 | |
| 3 | 3026 | |
| 2 | 2875 | |
| 1 | 314 | 3.6% |
| Distinct | 25 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.780707763 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 1794 |
| Zeros (%) | 20.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.619783957 |
|---|---|
| Coefficient of variation (CV) | 0.9574355344 |
| Kurtosis | 1.395419616 |
| Mean | 3.780707763 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.167339767 |
| Sum | 33119 |
| Variance | 13.1028359 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=25)
| Value | Count | Frequency (%) |
| 0 | 1794 | |
| 1 | 1229 | |
| 2 | 1000 | |
| 3 | 898 | |
| 4 | 775 | |
| 5 | 645 | 7.4% |
| 6 | 597 | 6.8% |
| 7 | 471 | 5.4% |
| 8 | 368 | 4.2% |
| 9 | 274 | 3.1% |
| Other values (15) | 709 | 8.1% |
| Value | Count | Frequency (%) |
| 0 | 1794 | |
| 1 | 1229 | |
| 2 | 1000 | |
| 3 | 898 | |
| 4 | 775 | |
| 5 | 645 | 7.4% |
| 6 | 597 | 6.8% |
| 7 | 471 | 5.4% |
| 8 | 368 | 4.2% |
| 9 | 274 | 3.1% |
| Value | Count | Frequency (%) |
| 26 | 1 | < 0.1% |
| 24 | 2 | < 0.1% |
| 23 | 1 | < 0.1% |
| 21 | 1 | < 0.1% |
| 20 | 5 | 0.1% |
| 19 | 8 | 0.1% |
| 18 | 8 | 0.1% |
| 17 | 16 | |
| 16 | 19 | |
| 15 | 32 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| rain | temp | rhum | wdsp | date | hour | day | month | year | holiday | dayofweek_n | dayofweek | working_day | season | peak | timesofday | rainfall_intensity | wind_bft | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.0 | 0.1 | 98 | 4 | 2021-03-01 | 0 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Night | no rain | 2 | 0 |
| 1 | 0.0 | -1.1 | 98 | 3 | 2021-03-01 | 1 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Night | no rain | 2 | 0 |
| 2 | 0.0 | -1.2 | 98 | 4 | 2021-03-01 | 2 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Night | no rain | 2 | 1 |
| 3 | 0.0 | -0.9 | 100 | 5 | 2021-03-01 | 3 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Night | no rain | 2 | 0 |
| 4 | 0.0 | 0.0 | 100 | 6 | 2021-03-01 | 4 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Night | no rain | 2 | 0 |
| 5 | 0.0 | 2.4 | 98 | 6 | 2021-03-01 | 5 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | False | Night | no rain | 2 | 0 |
| 6 | 0.0 | 2.4 | 98 | 6 | 2021-03-01 | 6 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Night | no rain | 2 | 0 |
| 7 | 0.0 | 2.1 | 100 | 4 | 2021-03-01 | 7 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Morning | no rain | 2 | 3 |
| 8 | 0.0 | 5.1 | 98 | 5 | 2021-03-01 | 8 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Morning | no rain | 2 | 1 |
| 9 | 0.0 | 5.7 | 98 | 5 | 2021-03-01 | 9 | 1 | 3 | 2021 | False | 0 | Monday | True | Winter | True | Morning | no rain | 2 | 4 |
Last rows
| rain | temp | rhum | wdsp | date | hour | day | month | year | holiday | dayofweek_n | dayofweek | working_day | season | peak | timesofday | rainfall_intensity | wind_bft | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8750 | 0.2 | 7.9 | 83 | 9 | 2022-02-28 | 14 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | False | Afternoon | drizzle | 3 | 0 |
| 8751 | 0.9 | 7.4 | 90 | 8 | 2022-02-28 | 15 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | True | Afternoon | moderate rain | 3 | 0 |
| 8752 | 0.0 | 8.3 | 81 | 4 | 2022-02-28 | 16 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | True | Afternoon | no rain | 2 | 0 |
| 8753 | 0.0 | 8.0 | 75 | 8 | 2022-02-28 | 17 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | True | Afternoon | no rain | 3 | 0 |
| 8754 | 0.0 | 4.5 | 81 | 9 | 2022-02-28 | 18 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | True | Evening | no rain | 3 | 0 |
| 8755 | 0.0 | 2.5 | 86 | 6 | 2022-02-28 | 19 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | True | Evening | no rain | 2 | 0 |
| 8756 | 0.0 | 2.2 | 86 | 7 | 2022-02-28 | 20 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | False | Evening | no rain | 3 | 0 |
| 8757 | 0.0 | 1.1 | 90 | 5 | 2022-02-28 | 21 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | False | Evening | no rain | 2 | 0 |
| 8758 | 0.0 | 0.0 | 94 | 6 | 2022-02-28 | 22 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | False | Evening | no rain | 2 | 0 |
| 8759 | 0.0 | 0.2 | 94 | 6 | 2022-02-28 | 23 | 28 | 2 | 2022 | False | 0 | Monday | True | Winter | False | Night | no rain | 2 | 0 |